More about Base Station Location Games 



Frangois Meriaux, Samson Lasaulce, Michel Kieffer 

L2S - CNRS - SUPELEC - Univ Paris-Sud 
3 rue Joliot-Curie F-91192 Gif-sur-Yvette, France 
{meriaux,lasaulce,kieffer}@lss. supelec.fr 

January 20, 2013 



O 



C/3 



> 

(N 
O 



> 
• ^ 

X 
S3 



Abstract 

This paper addresses the problem of locating base stations in 
a certain area which is highly populated by mobile stations; 
each mobile station is assumed to select the closest base sta- 
tion. Base stations are modeled by players who choose their 
best location for maximizing their uplink throughput. The 
approach of this paper is to make some simplifying assump- 
tions in order to get interpretable analytical results and in- 
sights to the problem under study. Specifically, a relatively 
complete Nash equilibrium (NE) analysis is conducted (ex- 
istence, uniqueness, determination, and efficiency). Then, 
assuming that the base station location can be adjusted 
dynamically, the best-response dynamics and reinforcement 
learning algorithm are applied, discussed, and illustrated 
through numerical results. 

1 Introduction 

Mobile terminals (MTs) are currently gaining increased au- 
tonomy of decision to allow a better use of the available wire- 
less resources. For example, MTs may choose their wireless 
access technology or the base station (BS) or access point 
to which they want to connect. We could imagine that this 
may be done in the future independently of the network op- 
erator owner of the BS. A mobile operator deploying BSs 
for a wireless network will have to deal with these new char- 
acteristics. If his goal is to maximize the traffic gathered 
by his own BSs, he will have to take into account the pres- 
ence of competitor network operators when deciding on the 
location of BSs. If every operator involved has the same rea- 
soning, this problem of BSs placement may be cast in the 
framework of game theory and more precisely in the context 
of location games. 

The history of location games starts with the work of 
Hotelling 8 in which the notion of spatial competition in a 
duopoly situation is introduced. More precisely, two firms 
compete for benefits over a finite segment crowded with cus- 
tomers. This results in the partition of the segment into a 
convex area of influence for each firm. Plastria [T^ gives 
an overview of optimization approaches to place new facil- 
ities in an environment with pre-existing facilities. A large 
overview on location games is also presented in [7]. Loca- 
tion games are extended to the context of wireless networks 
with works such as [T] and [2]. The main difference arising 



in this new context is the interaction between MTs due to 
the mutual interference they generate. This point makes the 
association problem between MTs and BSs complex. As an 
association between a MT and a BS depends on SINR, the 
association relies on the respective locations of the MT and 
the BS, but also on the MTs already connected to the BS. 
Whereas [2: focuses on the downlink case, in jl] the location 
of BSs and the association choice of the MTs is treated as a 
Stackelberg game [15] in the uplink case. The context of our 
work is similar to the one in [1] but several interesting results 
are obtained in the present paper. The main contributions 
of this paper can be summarized as follows: 

• As in [1] MTs are assumed to operate in the uplink and 
to be distributed along a one-dimensional region. How- 
ever, each MT is assumed to select the closest BS (e.g., 
based on measure given by a GPS -global positioning 
system- receiver). This leads to a convenient form for 



the BSs utility functions (Sec. 2.11. As a consequence 



the existence of a pure Nash equilibrium can be made 
rigorously (Sec. |3.l] ). 

Due to the symmetry of the problem, multiple Nash 
equilibria generally exist. However, if the locations can 
be ordered (which is easy for one-dimension regions), 
the Nash equilibrium can be determined and checked 



to be unique (Sec. 3.3 1 



By making the reasonable assumption that the BS 
heights are much less than the typical distance between 
the BSs, the game can be further simplified and shown 
to be a form of Cournot oligopoly [4j. 

In the two-player case, the efficiency of the Nash equi- 
librium is studied by evaluating the price of anarchy [9] . 
The influence of deploying its BS in the first place is 
studied by considering a Stackelberg formulation of the 



problem (Sec. 4.1 1 



Assuming that the BS locations can be adjusted dy- 
namically (which would be relevant in scenario like the 
one of small cells where only some of the small BS have 
to be active), the best response dynamics and rein- 
forcement learning algorithm [3] |14] [TT] are performed 
(Sec.[5|. 

The made assumptions lead to several interpretations 
which could be further analyzed in the light of a more 
general framework (e.g., in two-dimensional regions). 
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The remainder of the paper is organized as follows. Sec- 
tion [2] introduces the physical model and the parameters of 
the if -player game. Section [3] describes the Nash equilib- 
rium of the game in the one-dimensional case. In Section [Ij 
the Nash equilibrium, the Stackelberg equilibrium, and the 
social optimum are compared. Section [5] presents a way to 
reach equilibrium using best-response dynamics and rein- 
forcement learning. Finally, Section [6] concludes this work. 

2 Model 




Figure 1: Base station located at xi and mobile station 
located at x 



Consider a plane to which a frame R is attached. A MT X 
located in a position a; £ in this plane is linked with a 
BS X\ of height e situated in x\ e , see Figure [l] We 
define the Signal to Noise Ratio (SNR) and the Signal to 
Interference plus Noise Ration (SINR) of this MT 
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SINRx 



Px-hxAx) 
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^x,+Ix^{x)' 

where Px is the transmission power of the MT X, i.e. the 
level of power chosen by the MT to transmit its signal, ax^ 
is the power of the channel noise, Ixi (x) is some interference 
term, hxi (x) is the attenuation introduced by the uplink 
channel from X to Xi. Here, it is assumed that 



hxiix) = 



\x — xi\^ + e^) 



(3) 



where is the ^2-norm of x and a > 2 is the path-loss 
exponent, a — 2 corresponding to the free-space path-loss 
case. A higher value of a suits to worse channel conditions. 

With Single User Decoding (SUD) at the BS Xi, there is 
no hierarchy for decoding the incoming signals at the BS. 
Hence, the signal from MT X is decoded by taking into 
account the full interference and the uplink capacity between 
X and Xi may be written as 



Cx =loe{l + SINRx), 



(4) 



Without loss of generality, when several MTs are consid- 
ered, it is assumed that Px does not depend on the MT and 
is normalized, i.e., Px = 1. Moreover, the channel condi- 
tions, described by Q, are the same for every MT and the 
noise power is constant axi ~ o". With these assumptions, 
ffl becomes 







Cx = log 1 + 



\x — xi\'^ + e^) 
a^ + Ixd^) 



(5) 



When several BSs are located on the plane, the MTs are 
assumed to be able to choose the BS they want to be linked 
with. In this paper, we consider that this association is 
made based on SNR. Given ([3|, choosing the BS with the 
highest SNR is equivalent to choosing the closest BS. Thus, 
it is assumed that a MT always chooses the closest BS to 
its location, leading to convex cells for BSs. For example. 



considering two BSs Xi and X2 at positions Xi and X2 and 
a MT X at x, one has 



Cx 



1 + 



n \x — Xi\ ^ \x — X'zl 



1 / 1 I (|a:-a:2l +e ) ^ \ .„ i |2 ^ i |2 

log 1 + ^ ^2^7^ f J ] it \X - X2\ ^ \X - Xi\ 



2.1 Base station utility 

The utility of a BS is taken as the sum of uplink capacities 
it offers to its connected users. We assume that the number 
of MT is large enough to be represented by a continuous 
distribution p (x). This assumption allows to get the utility 
for the fc-th BS as a continuous sum of the MTs uplink 
capacities 



Uk{x) 



p{z) log 1 ^ ' ' 



Az, (6) 



where Sk (x) is the subset of the plane where MTs are linked 
with the fc-th BS and a; = *(a::i, x-i, ■ . . , xk) is the vector of 
locations for the set of BSs. This paper will consider only 
uniform MT distribution with p(x) — 1. 

When considering interferences, a worst-case scenario is 
considered: there is no mechanism such as beamforming [TO] 
to lower their effects. We consider that there is no interfer- 
ence between MTs connected with different BSs because of 
frequency reuse. Then only interference between MTs of a 
same BS has to be considered. This framework is quite sim- 
ilar to the one of [T], where two competing BSs are assumed 
to use different frequency bands. In our case, we consider K 
BSs (with K ^ 1) and each of them uses its own frequency 
band. 

Performing SUD at the fe-th BS, one gets 



(7) 



Mx) = / {\z-xkr+e') ^ dz. 
J Skim) 

Utility of fc-th BS (|6| then becomes 

{\z-Xk\^+e^)-^ 



log 1 + 



2.2 utility approximation 



dz. 



(8) 



When considering the low SINR regime, the useful power 
of each MT is small compared to the interference term in 
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3 Nash equilibrium analysis 

The aim of this section is to show the existence of a Nash 
equilibrium in the location game described in Section |2.3 
and to characterize this equilibrium. 



Figure 2: Two base stations competing in a one- 
dinicnsion space 



([sjl. This is especially true for high MT density. With this 
assumption, Q may be approximated as 



Uk{x) 



(9) 



Note that this simplification makes the considered utility 
based on capacity equivalent to a utility based on SINR such 
as the ones in Jj. At low-SINR regime, it is equivalent to 
work with a capacity-based utility or a SINR-based utility. 

Also note that one has f {t) = t/ (cr^ + strictly increas- 
ing over [0, L], since its derivative is /' (t) — a^/ {a^ + t)^ . 
Then maximizing the approximation of Uh{x) or Ik{x) is 
equivalent. Thus the utility we define for the game is 
Uk{x) = Ik{x), k £ K.. 

2.3 Definition of the game 

In this section, we study the case of K BSs competing on 
a segment of length L. Each of the BSs uses different car- 
rier frequencies so the MTs of different BSs do not interfere 
together. As assumed in Section [2] the MT distribution is 
uniform over the segment, and the set of possible locations 
for the BS is [0,L]. Figure [2] illustrates the context for the 
two-player case. 

Definition 1. The strategic form of the game is given by 
where 

• IC — {1,...,K} is the set of players, which are here 
BS. 

• {^k}k^ic of actions players can consider, here 

Ak^{xk€[0,L]\0<xi< ...< XK < L}. (10) 
Denote A = {xe [0, L]'^ \ < xi < . . . < xk < L} . 

• {C/fcjfegx; of utilities players use. 

Note that we are interested in location equilibria that 
do not superimpose several BSs. Thus, if there exists an 
equilibrium, there exists a spatial order for BSs at this equi- 
librium, that is why we introduce this order in the action 
spaces {AkjkeK.. 



3.1 Existence 

In this section, we focus of the existence of a Nash equilib- 
rium in the defined game. To prove the existence of a Nash 
equilibrium, the concavity of Uk{x) with respect to Xk over 
Ak, Vfc £ IC, has to be established. 

Lemma 1. Uk{x) is concave with respect to Xk over Ak, 
Vfc e JC. 



The proof of this lemma is in Appendix |A.1[ Then one 
has the following theorem. 

Theorem 1. In the game defined by Definition [7] there 
exists at least one Nash equilibrium. 

Proof. • Using Lemma [l] we know that Uk (x) is concave 
with respect to Xk over Ak, Vfc G IC, 

• Uk (x) is continuous with respect to x over A, Vfc G IC, 

• the set of feasible actions is compact and convex for all 
players in the game. 

The Rosen [TS] conditions for the existence of a Nash 
equilibrium are met and Theorem [l]is thus proved. □ 



3.2 Multiplicity of NE 

Regarding to the uniqueness of the Nash equilibrium, as the 
characteristics of the BSs (height and noise) are assumed 
to be identical, it is interesting to note that permuting the 
order of BSs leads to a symmetric system of equation. Thus, 



without condition on the order of BSs as in (10 1, there are 



K\ Nash equilibria for the game and all these equilibria are 
symmetric, meaning that the set of locations at equilibrium 
is unique. However, if one imposes the condition order ( 10 1, 



the NE can be shown to be unique by using the Diagonally 
Strict Concavity (DSC) condition [13j . 

Theorem 2. In the game defined by Definition [7] there 
exists one single Nash equilibrium. 



The proof of this theorem is in Appendix |A.2| Having 
uniqueness under this order condition might seem to be a 
weak result in comparison with a general uniqueness result. 
However, in the framework of a dynamic process, the initial 
locations of the base station might suffice to determine the 
effectively observed NE (after convergence). 



3.3 Determination of the NE 

A characterization of the equilibrium is provided in this sec- 
tion with examples for small values of if. A real solution x 
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Figure 3: Base Stations at Nasti equilibrium for a = 3 
(dotted line) and a = 2 (continuous line) . From the top 
to the bottom, configuration with 2 BSs, 3 BSs and 4 
BSs. Parameters are e = 0.1 and L = 100. 



for a > 2 has to satisfy 



Xi 



2\J (5.2 a -2 a -4)^2 ^2 a a;|-2Q X2 

2 ? 
4-2 c 

'k-1 

2 



-2\/(5.2J 



Xk=^-^±^P^, \/k£{2,...,K-l}, 

Xk = 

4-2 a 

(11) 

See Appendix |A.3| for detailed derivations. In the case q = 2, 
one obtains 



r XI = \/2e2 + 2x2 -a;2, 

I ^ ^.+1+^.-1 ^ vfce {2,...,a:-i}, 

I XK = -\/lOe2 + 2(L - XK-iY - XK-i + 2L. 
In the 2-player game with a = 2, one gets 
xT = L ~ IV2L2 - 4e2 



(12) 



(13) 



Figure [3] illustrates equilibria for a = 2 and a = 3 for 
respectively two, three, and four BSs. 



4 Comparing equilibria 

This section compares the Nash equilibrium described in 
Section [3] the Stackelberg equilibrium, and the social opti- 
mum. 

4.1 Stackelberg equilibrium 

As written in Section [S] the pure Nash equilibrium needs 
the players to know their spatial order to be reached. If 
players do not know this order but play in a chronological 
order, the problem changes. If the first player plays alone at 
the first stage of the game knowing that other players will 




Figure 4: 



and ^ 



C/l"^ — j^NB 



with regard to e. 



place their BSs after, it turns into a Stackelberg game [15] 
with the first player being the leader of the game. This idea 
is illustrated with a two-player game with one leader (BS 
1) and one follower (BS 2). The leader chooses its position 
knowing that the follower will place itself after. Both BSs 
still want to maximize their utilities and BS 1 knows this 
point. Then, BS 1 knows how BS 2 is going to be placed 
regarding to its own position, it is simply the best response 
of BS 2 



X2{xi) 



2a/(5.2= - 2« - 4)e2 + 2S(L - xi)2 - 2^;xi -I-4L 



4 - 2s 

Then BS 1 places itself at a location xi solution of 
dUl 



dxi 



(xi,X2(xi)) = 0. 



(14) 



With a — 2 and neglecting e compared to every other 
lengths of the problem, one gets 



Xi 



I-V2 + 



V2]L 



(15) 



and 



X2 



(^(V2~1){1 + ^J2-V2)^L. (16) 

On Figure[4] we compare the utilities of the leader and the 
follower of the Stackelberg game versus the utilities of the 
Nash equilibrium. As we see, the leader of the Stackelberg 
game has a better utility than what he would have get at 
the Nash equilibrium. On the contrary, the follower has a 
worst utility. Hence, in a mobile operator point of view, it 
is more interesting to deploy its BS first. 



4.2 Social optimum 

Sections [3] and [4T] provide equilibria corresponding to situ- 
ations where the BSs only consider what is best for them- 
selves. But these equilibria are not the best for the MTs. 
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Figure 5: Nash Equilibrium, Stackelberg Equilibrium, 
and Social Optimum for a = 2. 



In the IVETs point-of-view, the utility to consider is the com- 
plete utility sum or social utiUty. In a 2-BS case 



(7°° {xi,X2) = Ui {xi,X2) + U2 ixi,X2) ■ 



(17) 



For this utility, we know that there exists an optimum 
since the strategy set A is compact and Ui{xi,X2) + 
U2 {xi,X2) is continuous with respect to {xi,X2) over A. At 
this optimum, it is proven 1 that 

• (j) the BSs place themselves at the middle of their as- 
sociated subset of segment (respectively Si and 52), 

• (ii) the frontier between the two BSs is the middle of 
the segment. 



Thus, we have the optimum 

so L 

4 



X2 



3L 
4 ' 



(18) 



Figure [S] shows the Nash equilibrium, the Stackelberg 
equilibrium, and the social optimum for a — 2. The lo- 
cations of BSs for Nash equilibrium and social optimum are 
symmetric with respect to the middle of the segment [0, L] 
whereas this is not the case for the Stackelberg equilibrium. 



4.3 Price of anarchy 

To compare the Nash and the Stackelberg equilibrium in 
the two-player case, we look at the utilities at equilibrium. 
Without any pricing mechanism, the Nash equilibrium and 
the social optimum are very close in terms of locations. As a 
result, they are also close in terms of utility sum. The Price 
of Anarchy (PoA), introduced in [5], is an adequate metric 
to compare these sums 



PoA{eq) 



max^eAj2keicUk{x) 



(19) 



Note that the PoA is always stronger than 1 and if the 
PoA is high, it means that the corresponding equilibrium is 
not that efficient in terms of utility sum. On the contrary, 
if the Pol is close to one, the corresponding equilibrium 
is satisfying. Figure |6] illustrates the behavior of the PoA 
for the Nash equilibrium and the PoA for the Stackelberg 
equilibrium as a function of e for 2-player case. It appears 
that the Stackelberg equilibrium is less efficient than the 
Nash equilibrium. 
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Figure 6: Prices of Anarchy regarding to e. Parameters 
are L = 100, ct^ = lO'^, a = 2. 

5 Convergence to the Nash equi- 
Ubrium 



Getting to the equilibrium given in Section |3.3| is not sim- 
ple: it would require that every player knows the utilities 
of other players. In practice, this is hardly the case. Thus, 
we present techniques that enable to reach the Nash equi- 
librium in a decentralized way: best-response dynamics and 
reinforcement learning. However the BSs need to be movable 
to perform these two techniques, hence it is more accurate 
to talk about Mobile Stations (MSs) in the present section. 
Note that for these two techniques, the only assumption 
about MSs location is that they cannot superimpose. 

5.1 Best response dynamics 

The context of this section remains the same as defined in 
Section [231 

The principle of best-response dynamics is that given a 
realization of actions for its opponents, every player of the 
game is able to compare its own possible actions and choose 
which one is best for itself. Precisely the best-response al- 
gorithm is the following 

1. At every time step t, each MS k chooses its location 
x^{t) according to 

xf{t) = argmax tJu(xk,x_^.{t)). 

2. Algorithm stops when \x^'' {t + 1) - {t)\ < /3, with 
P fixed. 

Regarding to the convergence of this algorithm, we study 



the system of best-responses (111. If we neglect e regarding 



to xi and L — xk-i, one gets a linear system of equations 

l + a 2_ 



Xl = 
Xk = 
XK = 



4-2 c 
~k+l+- 



-X2, 



(4-2 



Vfc e {2, 

l + e» 



-XK-1- 



(20) 



It is a linear Cournot oligopoly HI. 

One distinguishes two ways this algorithm can run. 
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Simultaneous best response dynamics. At every step, ev- 
ery MS adapt their locations simultaneously. In this case, 
the evolution of the locations with the algorithm can be ex- 
pressed by 

x'"'{t + l) = a + AKx'"it) (21) 



with a= *(0, . . . ,0, 



l + Q 

(4-2 a )L 



4-2 e 

/O gia) 
i 



and 



(22) 



gia) 0/ 



with g{a) = 2—2 — ^-2-. 

4-2 Q 

As recalled in Lemma 2.8 of [161 . for an irreducible posi- 
tive square matrix A= (aij)i<i<n, then either 



^a,, =p(A)ViG [l,n], 



(23) 



iM y]««J <P(A)< sup Vfflij, (24) 

J = l J = l 

with p(A) being the radius of A. In our case, Ak can be 
verified to be positive and irreducible, then one has 



(a) < p(Aa') < 1. 



(25) 



Hence, the convergence of the algorithm is ensured. 

Sequential best response dynamics. The other case is the 
sequential algorithm: at every step, only one MS adapts its 
location. Depending on the MS k adapting its location, the 
algorithm evolves according to 



x'"{t+l) = a + A'kx'"{t), 



(26) 



with A^ corresponding to the adaptation of the fc-th MS. 
AJ^ may have several forms. 

• If the fc-th MS, k £ {2, . . . ,K — 1} adapts its location, 
then 



/I 







0\ 





I 
1 



In this case, A|- is irreducible and positive and 
piA'k) = 1. 



(27) 
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Figure 7: Best-responses of MS 1 and MS 2 for a = 2 



If the first MS adapts its location, the matrix has the 
form 

/O g{a) ...\ 



Ai. 











\0 ... 1/ 

Again, A}.^ is irreducible and positive, but this time 

g{a) < p{Ai-) < 1. (28) 

Note that if the K-th MS adapts its location, the same 
reasoning can be done. 



gia) < p(Af ) < 1. 



(29) 



Hence, pdTfeli ^k) = Ilfeli p(A-x) < 1 and the algorithm 
converges. 

In Figure [7[ we compare MSs 1 and 2 sequential best- 
responses for a = 2. 

5.2 Reinforcement Learning 

The following notations are used in this section. For a set 
of MSs K. = {1,. . . ,K}, let Ak = {vki, • ■ . , Vknik) be the set 
of possible locations for MS k [mk being the cardinality of 
Ak), which corresponds to a discretization of the set [0, L]. 

We implement a discrete stochastic learning algorithm in 
the sense that the action space is discrete and at every step, 
actions are chosen in a stochastic way. The only informa- 
tion available to a MS is the value of its utility after each 
iteration (note that the MS does not necessarily knows its 
utility expression). 

We define p^{t) = {pki{t), . . . ,pk,nkit)) the probability 
distribution vector of MS k at time t. 



P[xk{t) = pkr] ^Pk^it), i e {l,...,mfe}. (30) 
The algorithm used by each MS is then the following. 

1. Initialize the distribution probability vector. 
VfcG/C, ViG {l,...,m4, pfc.(0) = ^. 

2. At every step t, each MS k chooses a location Xk{t) 
according to its probability vector p^ (t) . 
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Iterations 



Figure 8: Evolution of action probabilities for BS 1 with 
discrete stochastic learning. 
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Figure 9: Evolution of action probabilities for BS 2 with 
discrete stochastic learning. 

3. Each MS gets Uk{t). 

4. Each MS updates its probability distribution vector 

Pki{t + 1) = Pki{t) - bUk{t)pki{t), if xt{t) / yki 
Pki{t + 1) = Pki{t) + hUk{t) Y^^^iPks{t), if Xk{t) = yfc, 

(31) 

5. Algorithm stops when p^{t + 1) = P^i't), else go to step 
2. 

Figures [S] and [9] illustrate the evolution of the probability 
distribution vectors of two MSs. The parameters of the sim- 
ulation are the following: each MS has the same set of possi- 
ble positions {10, 30, 50, 70, 90} and the step of the learning 
algorithm is 6 = 0.01. 

Depending on the choice of 6, two phenomena occur. 

• The higher the value of b, the lower the convergence 
time of the algorithm, 

• However, if b is chosen too high, the algorithm may 
converge to locations that do no correspond to an equi- 
librium of the game defined in Section [2. 3| 




Iterations 

Figure 10: Convergence of probability with respect to 
step b 

Figure [lO] illustrates the convergence time as a function 
of b. 

The choice of b is hence a trade-off between convergence 
time and accuracy of the convergence. 

6 Conclusion and perspectives 

Obviously the proposed model is relatively simple and 
should be improved to obtain refined results. However, this 
led us to several interesting results such as a full character- 
ization of Nash equilibria and interesting behavior in terms 
of convergence. It would be very relevant to extend this 
work to two-dimensional scenario, define a suitable order 
for which uniqueness would be ensured. More connections 
with the famous multi-source Weber problem [Bl should be 
established to better understand the general problem of de- 
ployment games. Indeed, if an operator has to locate a set of 
base stations, the problem becomes more complicated. The 
problem becomes even more interesting if the set of possi- 
ble constellations is discrete, which would lead us to make 
connections with Voronoi games [3. The authors believe 
there is large avenue for contributions to the general prob- 
lem under study, especially in finding relevant assumptions 
to simplify it without too loss in terms of understanding. 
Random matrix theory and stochastic geometry might be 
for great help to achieve this challenging objective. 
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A Proofs 

A.l Proof of Lemma [T] 

We need to prove that Uk{x) is concave with respect to Xk 
over Ak- 



We know that the index of players verify (10 1. In this 
case, the regions associated to the BSs are 

( [0, 24^] for BS 1, 

^ ^k-i+^k ^ ^k+^k+i ^ for BS k, ke {2,...,K -1} 



, L] for BS K. 



(32) 



Then 



lh{x)= !^-^{e^ + {x-x^f)-iAx 

Uk{x) = {e' + (x- Xkfyidx, fee {2,...,/C-l} 



Uk{x) = / i'j,_i+x^ (e^ + {x-XKf yidx 
2 

which may be rewritten as 

idx 



(33) 



U,(x)=!^[e^+x^r 
< Uk{x)^ f:^^JZ^ {e^+x^)-^Ax, k€{2,...,K-l} 
UK{x)=Sij^^{e'+x^)-^Ax 

(34) 

To prove the existence of a Nash equilibrium, the con- 
cavity of Uk {x) with respect to Xk ^k £ K, has now to be 
established. One has the first-order partial derivatives 



( aui , 



^(x) = -i(£^ + i^rr^ + + xiY 



gfe) = + C-^^fyi + \{e' + (f-izfi.)2)-f 



t '^{x)^^{e' + {L^XKY)~^ + W + { 
and the second-order partial derivatives 

dx'f 



1/^2 , /Xk-1~^k\2\ 



(35) 



a.i fe)--f 
< ^ix)-f{~ 



(^2 + (£2^)2) 2+1 
^k + l-^k 



(e2+:rf)2+l 



L)2)2- 



2^{x) = -a 



(e2 + (^£^i ^)2)2 

XK-l—^K 



) 



_ (36) 

Given ([lO|, we have ^^{x) < Vfc G /C. Thus Uk{x) is 



concave with respect to Xk over^fc, Vfc e IC. 



A.2 Proof of Theorem [2] 

In the context of A'-player game, the DSC [T^ condition 
writes V(a, a') G A'^ such that a' 



--)(||(.)- >0 (37) 



For clarity reasons, we denote 

g{a, b) = ( (e^ + a")" * - (e^ + 6^)'* ) , (a, b) G R\ (38) 
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By ( 35 1, it turns 



^1 -Q2 ^1 ^2 



■gsr(°;)-i5sr(!^ )=2 a( 5 ■ 2 



oa; ^ — ' ax, r. 



(39) 



Equation (371 becomes 



1 °l-°2 °1~°2 



^fc = 2 2 



)_g(L_aj^,L-a'^-) >0 



+ {a'K-ajf ) ^ 

which can also be written 
(a'l - ai)g(ai,a'i) 

+ 9 ^ 



(40) 



(41) 
(42) 



+ (L - Ok - (L - aK))g{L - ax, L - a'j^) > 
However V(a, b) G R*^ 

(b-a)g{a,b) > 
and by the order condition ( |10[ ) 

(a'i,ai) G R*2 

( °''-'",°''-^ , ""-""-^ ) G R*^ Vfc G {2, . . . , K} (43) 
[l- a'K,L- uk) GR*^ 

Then the DSC condition is verified and the equilibrium is 
unique. 



A. 3 Derivation of (11) 



To obtain a formal expression of a Nash equilibrium, the 
intersection of the best-responses has to be considered 



dUk 
dxk 



(x) = 0, Vfc G IC, Vq ^ 2. 



(44) 



031 leads to 



(2=(e2 + (2^)2))f = (£2+3;2)f 

(£2 + (L-:CK)')t = (2^(e2 + ( 



(45) 

Since all terms elevated at power a/2 are positive, one gets 



-1- 
2 



VfcG {2,...,^-!} (46) 



2f (£2 + ( -^-p^' )2) = £2 ^ _ ^^,)2 



One real solution x verifying ( 10 1 has thus to satisfy (111 



when a > 2 and ( 12 1 when a = 2 



